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ABSTRACT 


The need for tools capable of handling non-stationarities in the spectral content of 
the data has been recognized as early as 1946. The Wigner-Ville Distribution (WD) has 
been extensively used since its introduction in 1948, but suffers from some associated 
problems (e.g., spectral cross-terms and requiring the use of analytic signals). An alter- 
native Distribution is proposed, which has its origin in the definition proposed by Page 
of “Instantaneous Power Spectrum” (IPS). Its characteristics are examined and, when 
pertinent, compared to the WD. It is shown to be less sensitive to the problems aflicting 
the WD, but provides less frequency resolution. The usefulness of a parametric (AR) 
version is investigated. Some typical test signals are examined, to demonstrate the per- 


formance and trade-offs of IPS and its parametric version. 
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l. INTRODUCTION 
A. THE PROBLEM 


The use of the Fourier Transform as a spectral description of signals is a concept 
whose usefulness is restricted to the class of stationary signals. Though mathematically 
elegant and convenient, Fourier decomposition of a signal can often mask the true 
spectrum, since the assumed basis-functions implicitly oppose any notion of time- 
dependency. 

When the spectral content of the signal changes with time, as is often the case tn 
fields such as Communications, Seismology or Speech Processing, more powerful tools 
are needed. The trme-dependency of the spectral content should be apparent and meas- 
urable through the use of a more general type of representation of the signal. Gabor 
[Ref. 1] proposed such a representation by introducing the concept of a spectral de- 
scription in a joint time-frequency plane. 

Since then, several attempts have been made [Refs. 2, 3, 4, 5] to derive a, or the, 
function capable of correctly describing the distribution of thë signal's energy in this 
plane. Short Time Fourier Analysts (STFA) has been widely used and ts regarded as a 
very convenient approximation to the “true” distribution. 

However, obtaining this distribution is an under-determined problem, since an in- 
finity of valid solutions is introduced by allowing non-stationary descriptions. For 
example, anv time signal can be represented as a time-varying DC component. lo 


illustrate, consider the following discrete time signal: 


x(n) = cos( 4) (1) 


Two equally valid representations, in the sense that each one 1s capable of generat- 
ing the observed signal, are shown in Figures | and 2. 

Each one of these descriptions explains the existence of a sinusoidal time-series. 
Though there is no conceptual distinction between them, the algorithm of analysts must 
have the ability to decide which one will be taken as the “true” one, based on an ex- 
plicitly or implicitly built-in set of rules. The situation thus arises, where “one will see 


what one wants to see”. 





Figure 1. x(n) represented as a constant sinusoid 





Figure 2. x(n) represented as a time-varying DC component 


Further complications appear when we consider the fact that, given a time-series, 
we cannot determine if the underlying process is or is not stationary, without using ar- 
bitrary and often inappropriate assumptions about what is the local behavior. When the 
algorithm of analysis produces a time-varying spectrum, the fact remains that there is 
at least one equally valid time-invariant spectral description (the Fourier Transform 


based), which is arbitrarily ruled out. 


B. APPROACHES 

If the signal is to be represented in the joint time-frequency plane in a sensible 
manner, the distribution must have some key properties. For example, a shift in the 
time-series should always imply a corresponding shift of the spectral representation 


along the time-axis. Also, a multiplication of the time-series by a complex exponential 


should result into a shift of the spectral representation along the frequency axis. Without 
these properties. physical interpretation of the representation can be an impossible goal. 
This 1s one of the reasons why Ambiguity functions are not used in spectral] estimation. 

This technique of imposing constraints that are felt needed in a distribution can be 
carried further, and the distribution (or one of the distributions) formed in this refine- 
ment process defined as the true distribution. One of the most successful representations 
obtained by this approach is the Wigner-Ville Distribution (WD), which has extensively 
been used since its introduction in 1948 [Ref. 2). 

A different approach to solve the indeterminacy is to define, a priori and unambig- 
uously, what the considered true distribution is. Though this approach lacks control of 
the properties of the resulting distribution, it has the advantage of more closely pre- 
serving physical meaning in whatever results it produces. A typical example is the 
definition of “Instantaneous Power Spectrum” proposed by Page [Ref. 5], uniquely 


determining a resulting distribution which is amenable to physical interpretation. 


C. OBJECTIVES 

The Wigner-Ville Distribution has been extensively studied [Refs. 6, 7, 8. 9 ]. and its 
characteristics are fairly well understood. However, its use is still hampered by some 
problems. First. it has uneven performance for different classes of signals. It performs 
optimally for single-component linear FM (linear chirp), but has worse performance for 
less regular spectral dvnamucs. Also, when multi-component signals are present, the WD 
creates artifacts in the spectrum, ving mid-way between true components. These 
artifacts are. up to some degree, recognizable and treatable due to an alternating sign 
(Ref. 6]. but can severelv mask results when analyzing more complex signals. The use 
of analytic signals is usually required, not only to avoid the need for sampling at twice 
the Nyquist rate, but also to prevent the appearance of undesirable artifacts that would 
otherwise be created bv the interference between positive and negative frequencies 
Wels. 6, 7. 10]. 

Though it has not received proper attention in the literature, Page’s Instantaneous 
Power Spectrum is. once properly understood and treated, a practical alternative to es- 
timate the time-varving spectrum. 


The study and development of such an alternative is the main goal of this thesis. 


H. INSTANTANEOUS POWER SPECTRUM (IPS) 


A. DEFINITION 

In an attempt to accommodate the notion of instantaneous frequency content, Page 
defined the Instantaneous Power Spectrum as the derivative of a running energy spec- 
trum [Ref ab]: 


E Ó 2 
pS 0 (2) 
where 
I 
Ssi=1 «ve “Far. (3) 


That is, the Instantaneous Power Spectrum was defined as being, at each frequency, 
the rate of change of the energy collected by a Fourier transform taken from — oo . up 
to the time of analvsis. This concept was later extended by Levin [Ref. 3]. who intro- 


duced a complementaris backward run, similarly defined as: 


= 
Ct 





pr. = | STL (4) 


where 


Si) = | (year, (5) 
I 


and defining the Instantaneous Power Spectrum as the average of these two runs: 


IPS, N= P e A o t 0). (6) 


This expression can be put in the form [Refs. 3, 11 ] 
IPS(t, f) = Real [s ()S(fe?™7] (7) 


where S(/) 1s the Fourier Transform of s(z). 

IPS can thus be seen to be the real part of Rihaczek's distribution [Ref. 4], which 
allows one to think of IPS(t,f) as being the energy density entering, at time +, an infi- 
nitely narrow filter centered at frequency f. An enlightening treatment of tais point can 
be found in Ackrovd [Ref. 12]. 

l. Time Domain 

Let us now consider the following: we want a function of i,t whose Fourier 
Transform is IPS(t,f) . That is, we want G(r, T) such that 


oO 


IPS(t, f) = | Gli, pe P] dr. (8) 
Then, using (7) 
Ge jet | Es(0S (e PLA so) SN e? af (9) 


from where. with the appropriate variable substitutions we obtain 


o) = | o sU-— mean + co | s(t + nein Far. (10) 
Interchanging the order of integration, we get 
a 5 Esto) (= D+ s (st + +2). (11) 


IPS can now be restated as: 


OO 


IPS(1,f) = + | Es()s U—D)+s (Os(t+ De Fa. (12) 


Frequency domain 
Alternatively, a dual expression can be derived, expressing IPS in terms of the 


to 


Fourier Transform of the signal. 


OS So 


Le Stee) = + | s(ts (1 sat re PV az E > | s (t)s(t 4 re PV dr 


a 
O 


-f E S(u) al s “(ye Pa or US E 
af f S oak Sy, e 


a (13) 


oe 


fe 


-f fs SUIS yA e Sly — fdydu + 


l f -j?r 2 my! 
4| | ws) RT PS) — payda 


co oO 


I * — nf + En IPAS 
-+| SDI Ga a ii 4| SD gp. 


-00 


-00 


With the appropriate variable substitutions, we get 


oO OO 


IPS(t, fet | SQ S eP ay + > | SENS" dy 


—00 00 


and, finallv 


oO 


IPS, f) = > | [SIS (f-y) + SNS + y] edy. 


3. IPS for discrete signals 


The discrete version of IPS follows directly from ( 12 ) as 


A 


RO = > [s(n)s (2 —k) + (ms(n + Je E, 
EA 





A formal derivation of ( 16 ) can be found in Appendix A. 


B. RELATIONS WITH OTHER TIME-FREQUENCY DISTRIBUTIONS 
l. Rihaczek’s Distribution 


(14) 


(15) 


(16) 


The Rihaczek Distribution was proposed in 1968 as the true energy! distribution 


in an attempt to unifv existing results. 


Formally, the distribution is [Ref. 4] 


e(n = SNS (Ne A 


where s(t) is the analytic signal being analyzed, and S(f) is its Fourier Transform. 


(17) 


As was derived. c(t,. fo) is the complex energy density at the point (4. /,) in the 


time-frequency plane. A similar expression had been considered by Levin, who defined 


the Complex Instantaneous Power Spectrum as the complex conjugate of ( 17). 


* Rihaczek used the concept of “complex energy”, whose real part is the real energy, while the 


imaginary part is the reactive energy. 


Using Levin's notation, the Complex Instantaneous Power Spectrum is 
> e ofi 
A(t, =s DSe”, (18) 


where s(1) 1s the complex envelope of the real signal under analysis. 
We thus see from ( 7) that, though addressing a broader class of signals, IPS 
is both the real part of Rihaczek’s Distribution and the real part of Levin's Complex 


Instantaneous Power Spectrum. Formally, 
IPS(t, f) = Realfe(r, A]. (19) 


2. Cohen’s Generalized Phase-Space Distribution Functions 
In 1966, Cohen introduced a generalized class of time-frequency signal repres- 


entations, given by 


c(t, =| | | D(v, 7) s(t, o ls = AFAN ddr (20) 


where the choice of D(v, 7) will determine the resulting distribution [Ref. 13). 


If we slightly rearrange ( 20 ) to read 


OO 


c(t, p= | | Dv, o| | s(t, sa = peito tias, (21) 


O 


We can interpret 1t as the double Fourier Transform of V({v, 7) [Ref. 14]. where 


oO 


YM, t) = D(o, af s(t + a )s (t — 3 el Ede (22) 
We can now recognize ( 22 ) as a general expression for the well known Amb1- 


guitv function, which can serve as a general definition of the time/frequency 


autocorrelation function [Refs. 3, 4 ]. 


Each choice of weighting function D(v, 7) in ( 20 ) provides a different distrib- 


ution, because each one defines the combined autocorrelation ( 22 ) in a somewhat dif- 
ferent way. 


Let us see which distributions result from three different realizations of Q(v, 7) 
(complex exponential, constant, and real sinusoid) [Refs. 4, 13, 14 ]. 


a. Complex exponential 





mus 
D(v,t) =e 2 (23) 
The combined autocorrelation becomes 
ON 
Yo =P" | s(t, + a )s (4 — F La 
O 
e 
a ie) 
= | s(r, + — sr = je ene adr 
=00 
and. defining a new variable 1 = 1, + DE we obtain 
oO 
Yo.) = | sups — ted, « (25) 


The resulting distribution is 


(ts (ty — tt gO dido, (26) 


Cie s(t,)s (1; —T) Y, — dt, dt 


s(t)s (1 — re Vr 


OS 


(NS (e PA 


which is the well-known Rihaczek Distribution. 


(27) 


Though this definition of the combined autocorrelation is probably the 


most intuitive one, its lack of mathematical symmetry has the undesirable effect of 


making the distribution complex. It has extensively been used in radar theory, since 


( 25 ) is exactly Woodward's definition of the Ambiguity function [Ref. 3]. 


b. Constant 
D(v, T)= 
The combined autocorrelation becomes 


Po, )=| tn+5) (h>5 edi - 


The resulting distribution 1s 


c(t, f) = | | s(t, + > Ii a jolene PROD oy dedo , 


-0 Oa 


(25) 


c(t, f= s(t, +3) 0 TE e PIS, — di dr 
Res (31) 
E | E] rd dr 


which is the Wigner-Ville Distribution. This definition of the combined autocorrelation, 
though only slightly different from the one in ( 25 ), has the needed symmetry to make 
this distribution real, and mathematically very convenient (when dealing with analytic 
signals). 

c. Cosine function 


D(v, 7) = cos(zut) (32) 


The combined autocorrelation becomes 


oc 


Vo, 7) = cos(zvt) | s(t, + = )s (t, — = Jr, 


oO 
C Tm Tdi 
oe i ae s(t) + >)s de ar 
i ça (33) 
| a E 2nu(t, + 
-+ s(n+5)s (1 Te A po 
—00 


+ S(t +> )s ( e A oo Pas | 


a ; o À Ts 
Substituting ¢ for 1, + > in the first integral, and : for t, — > in the second, 


E 


Y(v, t) = 5 | Os (1— Je dt + | s (t)s(t + ema 
= 5 sos (++ 0st+ y] Aa. 
The resulting distribution 1s 
c(t. f) f i E L BOD Ir T Ie PRON dodt 
= | | > [s(ty)s (4, = 7) + s (t,)s(t, + t) le * (0, — t)dt,dt (35) 


to |— 


[sios =a F 7) je 2 Ydr 


Which is the expression we had found earlier for IPS. In this case, the definition of the 
combined autocorrelation also possesses the symmetry needed for a real distribution. 
The kernel that generates IPS ( (0, t) = cos(zpz) )) has been considered by Conem 
[Ref. 13], and shown to generate the Margeneau-Hill distribution [Ref. 15], well known 
in quantum mechanics theorv. 
3. Wigner-Ville Distribution 

Since the Wigner-Ville Distribution (WD) has enjoyed wide acceptance, we will 
define more closely the relations between IPS and WD. As an indirect connection, we 
can relate both to the Rihaczek Distribution. From ( 29 ) the WD implicitly defines the 


combined autocorrelation as [Ref. 4] 


¥(v, 1) = | sy +) a Aa 


= (36) 
Hence, the Wigner-Ville Distribution is given by [Ref. 4] 
WD. f) = nde] s(11)s (4, — pena | (37) 


where £, .[. ] denotes the 2-D Fourier transform. From the convolution property of this 


operator, and using ( 25 ), 


IV D(t, f w INe i] k*k rd | s(1)s (1) _ eta, 
— 50 (38) 
= | ** e(r, f) 
ES AE dede e(t, f 


where ** stands for 2-D convolution. and e(:, f) is the Rihaczek Distribution. From 


( 38 ), using ( 19 ) and the realness property of WD (see Section II-C), we obtain 
Dir = Reall eI kt ef, N| 
IES N = Real ol, f) ** el, f]. 


(39) 


We thus see how the two distributions can be obtained from the Rihaczek Distribution 


by an appropriate choice of the function convolved with e(7, f). 


13 


We found earlier that each choice of Y(v, 7) in Cohen’s class of Distributions 


( 20 ) yields a particular definition of the combined autocorrelation function 


co 


Yv, 7) = Ov, of s(t; + = js (4 — = e d (40) 


OO 


and that the 2-D Fourier transform of ( 40 ) resulted in a particular distribution, de- 
pending on D(v, 7). 

We see now that ( 40 ) is the product of two functions, D(v, 7) and the integral. 
Its double Fourier Transform is thus a double convolution in the transform 
domain.[Ref. 14] 

That 1s: 


A AA 


oO 


= FO a s(t, + a soe E eta, | 
na 2 (41) 
=D og ae {| A eta 
= o Ra Ps 
In the particular case of IPS, 
IPSE N= A eosto] TAT (42) 
Therefore: 
IPS, fy =cosGnye A (43) 


which we could have deduced directly from ( 39 ) and the fact that the WD is always 
real, as follows: 


IPS(t, f= Realle(z, $ 


» 44 
= — Lele, A) + ele A). 


Using ( 38 ), 


[PYV DA, pr READ, NN] 


A /) (45) 


= cos(4zfr)."* WD(r, f). 


At this point, we would like to be able to express the WD in terms of IPS, with 


a relation of the form: 


WD(t, fy = O[IPS(t, fy], (46) 


Where O[.] is an arbitrary operator. Unfortunately, the inverse filter does not exist, 


l | 
Fa cespe | ne 


is not defined over the required range of v and 7. 


since 


4. Short-Time Fourier Transform 
Though the Short- Time Fourier Transform (STFT) can not be considered a true 
Time-Frequency Distribution [Ref. 14], its widespread use fully justifies the study of how 
it relates to IPS. This hs been addressed in [Ref. 11], and will be done here. after which 


some indirect results will emerge. Let us thus consider the STFT as defined in ( 48 ): 


STFT(t, w) = | F(t, w)| i 
2 


= | s(t + 1)w (1) dr 
we (48) 


= | | s(t + t)s (1 > vw (t)w(vje te drdv 


=00 —00 


Substituting s(¢ + 7) and w*(t) by their Fourier definitions, we have that 


oO oO 


STFT(t, al | cursos | | WU ne Pape e“ dido 
2 | | s (t+ v)w(v)S(p + w)W (ptr! dodo (49) 


-| | s (r + 0)S(p + eE wo) pje "dude . 


We thus see that the STFT is a time-frequency smoothed version of the Rihaczek dis- 
tribution of the signal. The smoothing function is the Rihaczek distribution of the 
window used to compute the STFT. Since the right-hand side of { 49 ) 1s always real and 
positive, and the previous relations are valid for arbitrary signals and windows, we con- 
clude that windowing a Rihaczek distribution with a Rihaczek distribution guarantees a 
real and everywhere positive distribution. This result will be important when addressing 


the issue of positivity. 


C. BASIC PROPERTIES OF IPS 

As Was already mentioned, and 1s easily seen from ( 20 ), it 1s the choice of a par- 
ticular P(v, 7) that determines the particular distribution and, hence, its properties. It 1s 
thus desirable to establish direct connections between the properties of D(v, T) and the 
properties of the resulting distribution. This issue has been addressed [Ref. 14]. and we 
will in the most part only state results, as applicable to IPS. The basic properties of IPS 
are: 


l. Time shift 
li Of) SSIS 
IPS (t, f) = IPS ft — tof). (50) 
2. Modulation 
If w(t) = s(e2%'", then 
IPS (1, f) = IPS (t,.f— fo) - (51) 
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3. Marginal in Time 


GO 


| IPS, 9) df = | s(u) [É (52) 
4. Marginal in Frequency 
| IPS(t, f) dt =} S}. (53) 
uice S) = Fsi]. 
5. Realness 
S Scala (54) 
6. Instantaneous Frequency? 
| SIPS, Naf 
aang alll = f(t) (instantaneous frequency) (55) 
SU 
7. Group Delay 
| LPS 
er) (36) 
SAP é 


$. Zero Power 
(1) =0 = IPS(tp, f) =0 
SU) =0 = IPS(t, fo) = 0 


The Wigner-Ville Distribution also respects properties 1-7 [Ref. 6], but only a 


weaker version of property 8. 


D. FURTHER PROPERTIES 
Though the properties presented in Section II-C constitute the basic set of rules to 
which IPS is bound. some further analysis must be made. The effects of linear operators 


should be investigated, to fully understand the behavior of IPS. 


É This property is valid for analytic signals only ( see Appendix B ). 


1. Windowing in the time-domain 
The issue here is to determine how IPS 1s affected by windowing the original 
time signal, 


We will at this point introduce the notation: 


IPS Lt, f) == 1PS¿(, +3 1PSI Y) (58) 
where 
IPSZ(t, f) -| dd (t — rje de (59) 
and 
ESO -| d ()d(t+ rede . (60) 
If the signal is windowed, that 1s: 
A(t) = s(t)w(t) , (61) 


then the IPS of the windowed signal will be 


CO 


IPSAtp=t | Cdja’(e—1) +d (Nate — DjeV de 
j (62) 
- 5 somos — vw (t — T) + 


-00 


+5 (thw (Dsl + iwl + Je dz. 


Separating the two terms, 


Co 


IPS jt, = > | Es(Os (1 — Ifu(ow' (1 — 1)Je dr 


—00 


OO 


+> | [s(t + Ie (ut + je Pas 


00 


PER l E - F =x7 —jlnfx 4 
A re (63) 


oO 


—] — 1 E 
++) F [IPS*]F [Psi Aa 


SS 


oo oe 


| IPS>(t, v)IPS_ (1, f— v)dv + | IPS*(1, v)IPS*(t, f— v)ao , 


— 0 —>90 


a 


Sl E A ; 
where F [.] 1s the inverse Fourier Transform operators Hence 


IPS fi. f=s| IPS DPS za | + pers, (e f$ IPS% Dl. (64) 


where A denotes convolution in the f variable. 

That 15, while for the Wigner-Ville. windowing the time signal implies the con- 
volution along the frequency axis of the signal's WD with the window's WD [Ref. 6]. 
for IPS we have a sum of two convolutions: the first term of the signal's IPS (1PS;) is 
convolved with the first term of the window’s IPS (JPSz), and a similar convolution is 
applied to (1PS;) and (1PS;) . 

A special case occurs when the window, the signal, or both, possess conjugate 
Cmt Out ME the ume at whieh@ we are computing IPS ( Le., 


s(t+t)=s(t—1), w+ t) = w(t —1), or both ). If w(z) possesses that symmetry, then 


IPS, = IPS, = IPS,(1, f) (63) 
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and. by linearity of the convolution operator, ( 64 ) becomes 


OO 


IPS At, f -| IPS.(1, v)IPS,(t, f—v)dv. (66) 


00 


We thus see that in this case IPS and WD respond similarly to the windowing 
of the data: the IPS of the windowed signal is the convolution (in frequency) of the IPS 
of the signal with the IPS of the window. 

In summary, the IPS of a windowed signal is a frequency-smoothed version of 
the IPS of the unwindowed signal. The time-resolution is not affected. 

2. Convolution in the Time-domain 
The question now is: how is the IPS of a signal affected if the signal is pre- 


processed with a filtering operation? So, let 
dt) = s(thehy). (67) 
Then. 


IPS At. f) = Real [Es (0) + A] FTC) + AI eP] 
= Real [Es (1) + KOISA A] (68) 
= Real Esse ; De (HD 2] 


where we used the fact that 


eI a(t) * b(t)] = | a(t)b(t — t)dt 


“—O00 


oo 


= | a(t) bu RA 


=00 


(69) 


= [a(9e2%Y + [0977 . 


We thus see that (contrary to what happens in the WD? ) filtering the signal 
does not, in gencral. correspond to thë convolution in tinie of the signal's IPS with the 
IPS of the impulse response of the filter. What is observable from ( 68 ) 1s that the IPS 
of the filtered signal is now the real part of the convolution, along the time axis, of the 
two Rihaczek Distributions. The net effect produced on IPS is, thus, still a smoothing 
operation along the time axis. The frequency resolution is not affected. 

3. IPS and Moyal’s Formula 
As 1s often stated in the literature, IPS does not, in general, respect Moyal's 


formula (Ref. 16]. When applied to the WD, Moval's formula states [Ref. 17]: 


ED MOA /\aidl= < seg > sg > Í (70) 


where < sg > is the inner product of sand g. Let us now see what happens in the IPS 


Case. 


IPS At, fIPS.(t, fdidf = 


[sts + sos + Je Par (71) 


AS Al ale 0, = 


oO 


[e(e (r—v) +2 (Nett v)Je a |diaf. 


8.2) 


* The WD of a filtered signal is the convolution along the time-axis of the onginal signal's 
WD with the WD of the impulse response of the filter [Ref. 6]. 
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oO OO 


| | IPS(1, f)IPS,U, f draf = 


E 


E | | | Es(D)s (1 — 2) + 5 (s(t + DILg(Dg (1 — 0) + 8 (Det + 0) 


=00 Q00 =00 


oo 


| e PIM Idrdtdv 
a (72) 


oO oO 


+| | Es(Os (1— 1) + s (esl + DILglDg (+ 0) +g (Deli — )] di de 


Se es: 


| 


5 |— 


* ] * 
<S, g> SS + — <S 27 <S.g> 


j 


oO So 


+ Real | «os | s(1- De (1 + dr dt |, 


—o0 —00 


and, finally 


on 


| | IPS¿(t, f)IPS,(t. f)dtaf = 


o (73) 


= > < S, g> SuSe > Tr 5 Real | s(t)g(2) | S(t = Del + t)dt dt 


—O00 SO 


The result confirms that Moyal’s formula is not, in general, valid for IPS. Only 
in the trivial case of one of the functions being a real constant would the second term 
in the right hand-side of ( 73 ) reduce to 5 <52> <s,2 >`, and hence would IFS oter 
Moval's formula. 


ze 


4. Recovery of time-signals 
To address the problem of recovering a signal from its IPS, we will find it con- 
venient to consider two separate cases. 
a. Infinite duration signals 
For real signals with S(0) #0, recovery of the signal is always possible ex- 


cept for a sign indeterminacy, by considering 


[PS(t, 0) = Real[s(1)S(0)] 


= S(0)s(1) = 


and, since | S(O) | can be found by using Property 4 ( 53 ), onlv the sign of S(0) remains 
undetermined. No similar results are available for complex signals of infinite duration. 
However, 1f s(1) possesses conjugate symmetry around some /,. then we can also recover 
complex signals of infinite duration up to a phase term, bv considering F-' [ /PS(4.f) J. 
Similarly, if S(f) possesses conjugate symmetry around some fp and S(0) #0, we can 
again recover complex signals of infinite duration up to a phase term, by considering 
TES Jo) ). 
b. Finite duration signals 

For finite duration signals, recovery of real or complex signals is always 

possible except for a constant phase term (or a sign, 1n the case of real signals). 


Let us denote the starting time bv r= 9. From ({ 12) 
F TIPS. M= LOS (0) +5 (050) ] (75) 
since s(—7)=0 for 7>0, 
FU TIPSO. Mloo=+s (Oír) (76) 


which, by property 3 ( 52 ), leaves an unknown phase term. 


E. IPS - A GENERALIZATION OF THE WIENER-KHINCHIN THEOREM 
By redefining auto-correlation, IPS offers a very straightforward extension of the 
Wiener-Khinchin theorem to the non-stationary case, as we shall see. 


Let us define the auto-correlation function of a signal as 


Rig 1) = 


Te 


Eis(os (1 —t)+ s (1)s(t + t)} (77) 
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We have, then, using ( 12 ) and by linearity of the convolution operator the follow- 


ing pair of relations: 


oc 


© EIPS(1, f}= | R(t, t) edt 


-00 


oo 


o R(t, t) | EIPS E cura 
These relations can be read as a generalization of the Wiener-Khinchin theorem to 
non-stationary signals, since for stationary signals they promptly reduce to that theorem. 


If s(z) 1s a stationary signal, then 


R(t, t) = + E}s(t)s (1 OS T t)} 
= = Els()s (1-3) + 5 Els (0s(1 +2) a 
= R0)+>R(-2) 
= R(z) 
EIPS.. fA} = | Rije Far =| S() | (79) 


The expected value of IPS 1s, thus, for stationary signals, the usual Power Spectral 
Densi (PSD) 

Since in the non-stationary case we cannot try to infer ensemble averages from time 
averages, the IPS of a signal is one realization of the generalized Wiener-Khinchin re- 
lations of the underlving stochastic process. 

A different view of IPS when applied to stochastic processes can be found in Grace 
[Ref. 11]. where E{JPS(z, 9) is shown to be the Fourier Transform of Loeve's General- 


ized Power Spectral Density. 
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MI. RETHINKING IPS 
A. ON THE USE OF WINDOWS 


IPS, as given in ( 12 ), suffers from intense ringing at all frequencies “touched” by 


the signal. This same effect is one of the reasons why Rihaczek’s Distribution did not 


find much acceptance. An example can be seen in Figure 3, where a linear chirp was 
used as the test signal. 





Figure 3. IPS for a linear chirp 


This ringing effect can easily be understood by slightly rearranging ( 12 ) to: 


OO co 


LC = 5 E suo Pedr +s (| set peas | (SO) 


—00 —oo 


Zs 


With the appropriate changes of variable, we get 


oo oo 


IPS(t, fy = + of Ss (te PA Dar + “| s(t)e! AE vao | 


—00 SS (81) 
[s(t) Esto) + PT + S~) Este) + 2%] 


eal [ s (D [s(t) « 2% ] 


But s(t) » e2'* is the output of a filter whose impulse response is a complex exponential. 


IPS is, thus, for each frequency, the real part of the product of the signal's complex 


conjugate and the output of an infinitely narrow non-causal filter (non-causal oscillator) 
(Figure 4). (Ref. 12] 





Figure 4. Model of 1PS 


The ringing is hence unavoidable, and will persist until the filter's impulse response be- 
comes negligible. We should consider stable filters, with decaying impulse responses, if 
We Want to diminish this effect. This reasoning leaves us very close to the work of Fano 
[Ref. 18], which was later extended by Schroeder and Atal {Ref. 19]. 
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If we replace the oscillators in the IPS definition by stable filters with impulse re- 
sponses of the form 


h(t) = w(ne?™ , (82) 


where w(/) 1s any real and symmetric function of ¢ with general lowpass characteristics 
and w(0) = 1, then the resulting distribution D(z, f) is given by 


Ds(1, f) = Real [s ()[s(2) * w(e?™"J] 


E E Esso * we YY + Esto * we?) 
: B (83) 


sls (wli = ne RA + > s (t)s(v) w(t — ve? dy, 


| 
‘ale 


=) IO 


Substituting 4, for (1— 7) in the first integral, and —1, for (1 — uv) in the second integral, 
We get 


oO 


oy) = + Es(o)s (1 = ipa s (1)s(1 + 11)] w(1,) A ; 
o (84) 


— OD 


= PS ¿y lts Sf) 


Where y(7) = u(t —1). 

That is. the resulting distribution is the IPS of the windowed signal, where the win- 
dow 1s the envelope of the filter impulse response. centered at the time of analvsis. Since 
(1 — t) 1s real and symmetric around 7, we see from ( 66 ) that a smoothing of the IPS 
in the frequency direction will result. 


IPS in general can hence be written as 


* 


IPSt, = + ES U- D+ (st + D] wO e Lar, (85) 


ASA 


where w(t) 1s the chosen window. 


To illustrate the effectiveness of the windowing operation, we computed the IPS of 


the test signal used in Figure 3. A Hamming window was used in ( 85 ), and the results 


are shown in Figure 5. As is observable, the ringing is avoided. 





Figure 5. IPS fora linear chirp - Hamming window 


B. PROPERTIES AFTER WINDOWING 
Some of the properties of IPS will be affected by the windowing operation. It will 
thus be necessary to deternune which properties are modified and establish the con- 
nections between the characteristics of the used window, and the effects it generates. 
Tracing back the windowing operation to the kernel function (@(v, t)) in ( 20 ) that 
it implies, we will be able to use well established results [Ref. 14] about these functions. 


The resulting kernel function will be 
D(v, 7) = u(0)w(7) cos(ruz), (86) 


which can be proved using ( 19 ) as follows : 


IPS, f) = | | | u(0)w(7) cos(rur)s(1, + r r= ET elt gp PRON udrdr, . (87) 


090 MN DOMO) 
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Replacing the cosine by complex exponentials, we have 


ESOS O) 


“u(0)w(7 : a E ER 
NE |) -| | A s(t, teas (1, ÓN > die e 


a ” (88) 


(0w 
+ oa y kata s(t, + as 


—00 —OO -00 


* 


e a ae A 
US A PROD dr 


and, by convenient change of variables, 


(0) i ON EA 
IPS(t, f) -| | a Rie dhe di 
ca "N (89) 
— EO - s (u)s(u + rje dpe PEN ar i 


Interchanging the order of integration, we obtain 


IPS(t. f) = [s()s (1 —t) +5 (fs(t + YJ Ole ar. 


A 
2 


A, 


completing the proof. 

We can now easily determine which properties are maintained after the windowing 
operation, by direct application of the results in [Ref. 14), mapping the characteristics 
of the kernel function to the properties of the distribution. The conditions on D(v, 7) 


necessary for the preservation of each of the properties will also be given. 


(4) 


. Lime shift 


If œ(t) = s(t — la), then 
IPSE S= IPS =D 


The requirement onk oir S 


D(v, 7) constant for all t. 


This property will thus be maintained for every used window. 


Modulation 
If w(t) = s(ne?%', then 


IPS mn) UBS — Lo) - 


The requirements 
@(v, t) constant for all f. 


This property will thus be maintained for every used window. 


Marginal in Time 
| IPS(t, fldf=|s(a) |’. 


The requirementson (ys 2) as: 
D(v,0)=1  forall v. 
Maintained iff w(0)= 1. 


Marginal in Frequency 


| IPS(t, Jdr=|S(f |’. 


The requirement on D(v, T) is: 
(0,7) =1 forall. 


Not maintained for practical windows. 
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(91) 


(92) 


(93) 


(94) 


(95) 


(96) 


(97) 


5. Realness 
IPS(1, f) is real for allz, f. (99) 
The requirement on (v, 7) is: 
@(v, t) =O (—v,-7). (100) 
Maintained if the window is real and even, or at least possesses conjugate symmetry 


around the origin. 


6. Instantaneous Frequency 


| JIPS, Naf 
E = f(t) (instantaneous frequency) (101) 
st) | 
ic requirement on D(b, 7) is: 
D(v,0)=1 and ae M(v, t) l-o = for all v. (102) 
Maintained if 
OE 
e w(t) [9 = 0 


7. Group Delay 


| SU dr 


= =T(!) (group delay) (105) 
ISI i 
meme q imement on Di, +) 1S: 
D(0,7)=1 and a @D(v,t)|,-9=0 for all c. (104) 
Not maintained for practical windows. 
S. Zero Power 


This property is clearly maintained, due to the multiplicative nature of the win- 
dowing operation. 


S) =0 = IPS(t, fo) =0 (106) 


This property will not be maintained since, from ( 64 ), a convolution along the 
frequency axis will be performed. 
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C. SPECTRAL RESOLUTION 
|. Time-invariant Components 

Though IPS is a tool directed towards the analvsis of signals with time-varying 
spectral contents, some insight into its inner workings will be gained by considering its 
resolution capabilities for time-invariant spectral components. A comparison of resol- 
ution capabilities for IPS and WD will be made, and contrasted with the standard 
Fourier Transform. 

As seen in ( 6 ), IPS is a coherent average of two terms, one of which uses only 
past data, while the other uses only future information.* When analyzing finite duration 
signals, the “past term” will have its maximum resolution capability at the end of the 
data segment, since it is at this poimt that the most past data is available. Similarly, the 
“future term” will resolve better at the beginning of the data segment. This can be seen 


in Figure 6 and Figure 7, for the past and future terms, respectively. 


Contour plot 
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Figure 6. Sinusoidal component. Past term. 3-D (a) and Contour (b) plots 


If the two terms are averaged as in ( 6 ), the resulting expression (IPS) will keep the good 


resolution at end-points that each component provides, degrading its resolution 


* A note of caution should be made here. These “past” and “future” terms should not be 


identified with the two terms in ( 12 ), since each one of the terms in this formula spans all the data, 
from — on to o». 
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capabilities towards the center of the time segment. This effect can be observed in the 


contour plot of Figure 8. 
Contour plot 
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Figure 7. 
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Figure 8. TIPS for a sinusoidal component. 3-D (a) and Contour (b) plots 
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This is one of the characteristics of the unwindowed IPS. It has better resolution at 
end-points than it does in the middle of the data segment. We should point out that the 
WD does the opposite, presenting a thinner main-lobe at the center of the time interval, 


and loosing resolution capabilities towards the end-points (Figure 9). 


Contour plot 
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Figure 9. WD for a sinusoidal component. 3-D (a) and Contour (b) plots 


A more formal discussion can be made as follows: Let us denote by CUY, (7) the 
function of 7 whose Fourier Transforin is WD(t,, f) and, similarly, denote by Cf, (7) the 


function of t Whose Fourier Transform is /PS(t, f). Hence, 


E 


CH, (1) = 5 lo — > slo +>) 


2 
(107) 


Chit) =F Estto)s (to — 2) + 5 (to)s(to + 7)] 


where s(t) is the stationary signal of interest. That is, both IPS and WD create, from the 
data, a new “signal”, whose Fourier Transform is the wanted spectrum (Figure 10). It 
is thus the effective duration of this new signal that determines the resolution obtainable 


in the frequency domain. 
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Figure 10. {IPS and WD: conceptual diagram 


Let us now assume that s(t) is only known within a finite segment of length L, 
and compare the theoretical resolution of JPS and WD at both the end-points and the 
middle of the segment. 

a. End-points 
Since both end-points are treated sinularly, we will only consider the start- 
ing point of the known signal, which we will denote by 4. Let us consider that a 
rectangular window of length L was applied to the signal ( Figure 11). Itis easily seen 
from ( 107) that 


O tTHO 
CW, = 2 (108) 
o ist t=0 
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and thus, HD, (/) (Fourier transform of a delta function) is a constant. On the other 
hand, the windowing of the data implies that a window ( w;(1) ) of length 2L ts applied 
to C/,(t) (figure 12). 

wea T O 





Figure 11. Window applied to the signal 





Figure 12. Window applied to C/,(r) 
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When we now take the Fourier transform of C/,(z) , we will be taking the transform of 
a signal twice as long as the original data. This creates a main-lobe twice as narrow as 
the main-lobe that would be obtained if we were to Fourier transform the data. How- 
ever, we know from ( 107 ) that Cl, (7) = CI (—7). This then implies that, from the 
obtained segment of length 2L, only half contains information. As a result, at end- 
points, IPS presents a main-lobe twice as narrow as the one obtainable by Fourier 
transforming the data, but without an equivalent improvement in effective resolution. 


b. Middle-point 


We now consider that the window is symmetrically placed around h, as in 
Figure 13. 





Figure 13. Symmetric window applied to the signal 


Again, it is readily scen from ( 107 ) that windowing the data with a window of length 
L centered at 4 implies the windowing of CIV, (7) with a window of length 2L. The eflect 
on Cio), however, is to window it with a window of length L. This discrepancy implies 
that, in the middle of the interval, the WD has twice the effective resolution of IPS. Let 
us note that, as discussed before, for both CHV,,(z) and C1,(r), only hall of the length 
carries information. Hence, their Fouricr-transform will have its main-lobe artificially 
narrowed by a factor of two, without improvement in effective resolution. In summary, 
the WD has at the middle of the time interval the same effective resolution that IPS has 
at end-points, and hence the same effective resolution obtainable by Fourier- 


transforming the data. 


a7 


This fact of IPS having twice as poor resolution m the middle of the time 
interval than at end-points will be important, smce the windowed implementation 
normally uses the window centered at the time of interest. This then means that these 
implementations are using, at each time, the Worst estimate (in terms of resolution) that 
IPS will provide. The model of a typical windowed implementation of IPS can be seen 


in Figure 14. 


O Y 
ERES 





Figure 14. Typical implementation of IPS 


The overall result is that, for these implementations, IPS has, by a factor of two, lower 
effective resolution for stationary components when compared with the WD. 
2.  Time-varying Components 
One of the most noticeable characteristics of IPS is the fact that the width of its 
main-lobe depends on the dynamics of the signal. A typical example can be seen in 


Figure 15, where a quadratically chirped FM signal is used. 
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Contour plot 
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Figure 15. IPS for FM signal - Quadratic chirp. 3-D (a) and Contour (b) plots 


This test signal has its instantaneous frequency increasing quadratically with time. The 
width of the main-lobe is directly proportional to the slew rate. In Figure 16, a cross 
section along the frequency axis of IPS for the same test signal is presented. The past 


and future terms are also indicated. 
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Figure 16. IPS for FM signal - Quadratic chirp. Cross-section 
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As is seen, the width of the main-lobe of IPS results from the imperfect alignment of the 
two terms: the past term seems to lag the true position of the instantaneous frequency, 
showing a tendency to be positioned in locations previously occupied by the signal, while 
the future term: leads the true location of the instantaneous frequency. This lag and lead 
effect become more severe for faster dynamics, hence broadening the main-lobe of IPS. 
In an attempt to explain this feature, one might be tempted to argue that IPS 
lacks symmetry in its definition. Despite being clearly insufficient, as we will see, it will 
be instructive to pursue this type of reasoning a little further, since some additional re- 
sults will emerge. When computing C/,(t), we are, for each t (without loss of generality 
assume t >0 ), coherently averaging two terms. One results from the present and past 
signal history (x(1)x(t + 7) ), and the other results from present and future signal history 
(dol +7)). To obtain I[PSiwe rill extract the frequeme, contents o| the sequenecmsc 
obtained. However, if the signal has frequency dynamics, each one of the terms will 
contribute differently to the frequency contents of CI. (7). For example, if the instanta- 
neous frequency of the signal is increasing in time, the contribution of the present-future 
term (x (Dx(1 + 7) ) will have higher frequencies than the present-past term, since it 1s 
centered in a region with higher average instantaneous frequency. The temptation thus 
arises to compensate for the difference in the centers of the two regions. This effectively 
corresponds to forcing the two terms in Figure 16 to align “correctly”, thus eliminating 
the broadening of the main-lobe. To do so, it suffices to introduce compensating phase 
terms in the process of computing IPS (see . Alternate way of computing the Wiener- Ville 
below). When these compensating terms are introduced, the distribution that results is 
the Wigner-Ville distribution. We can now interpret WD as: the WD is the distribution 
that results by aligning the two terms of IPS. 
o = Alternate way of computing the Wigner-Ville 
From the previous discussion, an alternate wav of computing the WD can 
be derived, which allows us to directly apply the WD to real signals sampled at the 
conventional Nvquist rate, thus avoiding the need for oversampling that would be 


required if the WD was to be computed in the usual way [Refs. 7, 10]. The two 
step procedure 1s: 


1. Compute 


F(t, v) = Esos e de es (Os ae Je at (109) 


A 
2 
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2. Compute 


WD, = F(t, ve PD andr (110) 


II O 


As can easily be recognized, the need for a 2-D Fourier transform places the 
cost of this procedure well above the one needed if an interpolation scheme is used. This 
consideration may disappear in emerging fields such as optical signal processing. 

All our previous discussion has been based on a loosely defined lack of symme- 
try, inumately related with the concepts of past and future. As pointed out earlier, this 
explanation is clearly insufficient. To illustrate why, a different definition of “Instanta- 
neous Power Spectrum” is made in Appendix E, thus creating a new distribution. The 
important point to be made is: not only are the concepts of past and future inexistent 
in this new definition, but it also possesses perfect symmetry. However, the resulting 
distribution 1s very simular to IPS, with the characteristic data-dependent width. Since 
lack of symmetry cannot be argued for this definition, the explanation for the broaden- 
ing of the main-lobe must be found somewhere else. 

In Appendix C, the issue of uncertainty 1s addressed. One important result to 
extract from there is: the maximum obtainable frequency resolution when analyzing 
signals with unknown frequency dynamics is Jdfjdt. This result 1s in agreement with the 
fact, proven by Rihaczek [ Ref. 4], that signals with strong phase modulation have, at 


each ume. their energy concentrated within a frequency band B, of size 


E (111) 





where (z) is the signal's phase. It thus explains the broadening of the main-lobe of IPS 
in terms of the expression of uncertainty. Since the uncertainty region grows for in- 
creasingly faster dynamics, so does the width of the main-lobe. Put in simpler terms, “the 
faster it moves, the harder it is to locate”. According to the results of Appendix C, any 
algorithm should behave in this manner, unless it has, or assumes (directly or indirectly), 
information concerning the dynamics of the signal. However, mot assuming is the key 
rule for a robust algorithm, one whose performance does not depend on the class of 


signals being analvzed. 
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D. THE CHOICE OF A WINDOW 
We know from ( 66 ) that. if we pre-window the signal with a window real and 


symmetric around the time at which we are evaluating IPS, the resulting IPS will be: 


IPS E |) IPS (o IPS Os (112) 
But 
PS ij > ONGS o NT 2 ide (113) 
and, since w(t — 7) =w(1 + 7), 
BES GR = | w(Dw(t + re Ear 
ed (114) 
= w(t (ner 
= w (OIF () , 


where w (t) = w(1 + 7), that is. w, is the used window shifted to the origin. 

Hence, if a real and even sliding window is used. the smoothing function in ( 112) 
is the Fourier Transform of the window w(t) . It is interesting to compare this result 
with the Pseudo-Wigner Ville (frequency smoothed Wigner-Ville), where the smoothing 
function is not the Fourier Transform of the window itself, but the Fourier Transform 
of the square of the window (in a rescaled frequency axis) [Ref. 6]. We thus see that all 
the knowledge on window functions for the Fourier Transform can be directly applied 
to the choice of a window to be used with IPS. 

Though the smoothing function in ( 112 ) is always real (we are considering only real 
and even windows), it may be negative. This can contribute to the presence of strongly 
negative values in the resulting IPS, especially if the side-lobe structure of the window 


is not controlled. This thus suggests that windows with good side-lobe suppression 


should be preferred. Spectral resolution is another important issue to be considered 
when choosing a window. 

As discussed in the previous section (Section 111-C), IPS has a main lobe whose 
width 1s proportional to Jaf lat , in accordance with the uncertainty principles of Ap- 
pendix C. Hence, degrading the apparent frequency resolution of the unwindowed IPS 
does not become a relevant issue, if working with windows whose effective duration 1s 
well above the reciprocal of the uncertainty region. The width of this region should thus 
be the criteria for the choice of window size, but would require a-priori knowledge of the 
dynamics of the signal. In the absence of this knowledge, optimality will not be achieved. 
However, as We will see. IPS is a very forgiving tool, in the sense that its performance 


is not seriously affected even for large deviations from the optimal window size. 


IV. EXPERIMENTAL RESIES 


To illustrate the tvpes of behavior predicted in the previous sections, some test cases 
will be presented. For all the cases the test signals are 128 point sequences, with one 
spectrum for each sample. All plots are on a linear scale. Some considerations relating 
to the Pseudo Wigner-Ville (PWD) were made: all test signals are analytic, thus freeing 
the PWD from the somewhat annoying interferences between positive and negative fre- 
quencies. The exception to this rule will be Figure 19, since its sole purpose is to 
illustrate the effects of the use of real signals. Also, when noise is present (Figure 24), 
analytic noise is used. Unless otherwise stated, all Figures use a 41-point Hamming 
window, and present the distributions after smoothing along the time direction. Hence, 
most of the spectral cross-terms of the PWD due to multi-component signals do not 
appear in the plots. To illustrate the effectiveness of time-smoothing, Figure 17 was 
provided, differing from Figure 21 only in the fact that the time-smoothing technique 


was not used. 


) 


| 


y 
| 
" 
Y 


O 
N 
| 
) 


A 
n 


y 
O 
| 


| 


‘OY 


FREQUENCI AA IS FREQUENCY AXIS 
(a) = (b) 


Figure 17. FSK. IPS (a) and PWD (b). No time-smoothing. 





A. REAL SIGNALS 
In Figure 18, IPS and PWD for a linear analytic FM chirp are presented. 


Comparing Figure 18 with Figure 19, where the test signal is a linearly chirped cosine 
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function, we can appreciate how insensitive IPS is to the fact of the signal being real. 


The artifacts present in the PWD when analyzing real signals are, for all practical 


purposes, absent in IPS. 
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Fieure 18. Analytic linear FMI chirp. IPS (a) and PWD (b). 
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Figure 19. — Linearly chirped cosine. IPS (a) and PWD (b). 


B. INSTANTANEOUS POWER 
Another visible effect in Figure 19 is the “amplitude modulation” of the main-lobe 


of IPS. IPS gives us the distribution in frequency of the instantaneous power of the sig- 
nal. It then follows that, when computing the spectrum at a low power sample, the 
amplitudes of the frequency components will have to be small, since their total power 
must add up to the low power of the sample. This is a direct consequence of ( 95 ) and 
is a requirement that has be fulfilled by any positive distribution. It is not observable in 
Figure 18, since the used analytic signal has the same power at all samples. 

The WD, though also obeying the Marginal in Time property, tends to absorb the 


fluctuations of the instantaneous power in its cross-terms, and, in order to do so, is 


forced to assume strongly negative values. 


C. END-POINT RESOLUTION 
Though the better resolution at end-points provided by IPS is in general well visible 


in all the plots that are presented, Figure 20 is probably one of the most representatives. 
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Figure 20.  FSK (127-point Hamming window). IPS (a) and PWD (b). 
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Considering it, we see that not only is the end-point resolution of IPS better, but also 
the transition times are shorter, allowing a more accurate definition of the time of spec- 
tral jumps. 

As stated in Section III-D, IPS is a forgiving tool, in the sense that we can deviate 
from the optimal Iength of window without severely degrading its performance. To 
illustrate, Figure 21 is presented, which differs from Figure 20 only in that it uses a 


window with different effective duration. 
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Figure 21.  FSK (5l-point Hamming window ). IPS (a) and PWD (b) 


As is seen, the time resolution is basically kept the same, the only noticeable difference 


being the width of the window’s main-lobe. 


D. MULTI-COMPONENT SIGNALS 
Performance for multi-component signals is addressed in Figure 22 and Figure 23, 


where the test signal has three components: a linearly chirped FM signal, a quadratically 
chirped FM signal, and a stationary component. As seen, IPS is free of the cross-terms 
appearing in the PWD. However, for IPS, each one of the spectral components has its 
width affected by the associated uncertainty region and is, hence, wider than the corre- 
sponding one in the PWD. Again, the instantaneous power modulation is visible in IDS. 


These effects are also apparent in the provided contour plots. 
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Contour plot 
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Figure 22. 


Contour plot 
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Figure 23. PWD for Multi-component signal. 3-D (a) and Contour (b) plots 
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E. PERFORMANCE IN NOISE 

One issue still not addressed is the performance in noise. In Appendix D, a proof is 
given that, for a signal embedded in additive bandlimited white Gaussian noise, the 
variance of IPS is, on the average,” smaller than the variance of WD. Also, from the 
same Appendix, we have that, when the bandwidth of the noise approaches infinity, the 
improvement in the Variance given by IPS approaches the limiting value of 3 dB. 
Figure 24 is provided to illustrate this theoretical result. The SNR for this picture is 


5 dB, and tt is seen that IPS does, indeed, present a smoother estimate of the instanta- 


neous spectrum. 


(SS 


( 


Wig 
a Y x 


O 
a H 
w É 
E sil 
E 


TIME AXIS 


FREQUENCY AXIS PREUCENGY AXIS 
(a) (b) 





Figure 24. FSK ( SNR=5 dB ). IPS (a) and PWD (a) 


Pomtlt@CaArruRE EFFECT 
Having borrowed the term from communication theory (FM demodulation), our 


goal is to compare the behaviors of IPS and PWD when analyzing multi-component 
signals, When the components have different amplitudes. In Figure 25, the two compo- 
nents have amplitudes differing by a factor a four (12 dB). As is observable, both IPS 
and PWD detect the weaker component, despite the fact of the cross-terms in the PWD 


being already more energetic than the weak signal itself. In Figure 26, the same two 


Se ! ae ; ; 
The variance of IPS is time dependent. When we say that ”...the variance is...on the aver- 
age....”, we are referring to the time average of the vanance. 
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component signal is presented, now with a factor of 8 between amplitudes (18 dB). It 


is obvious that the PWD lost the weak component. The cross-terms, however, still re- 
main. In this case, the position of the weaker term could be inferred from the cross- 


terms, but such a technique would be impossible to apply to more complex signals. 
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V. PARAMETRIC IPS 
A. INTRODUCTION 


We have, from ( 12 ), that the IPS of a signal is, at each time, the Fourier transform 
of a function of the signal. This implies that the wanted information lies in the spectral 
content of this function. The process of obtaining the IPS can therefore be reduced to 
the problem of performing conventional spectral estimation on a function G(r, tT) which 
is derived from the signal as in ( 1] ). The use of parametric methods should thus be 
considered, in an attempt to improve the resolution capabilities of IPS. These consider- 
ations are also applicable to the WD, and a parametric version of this distribution has 
been considered [Ref. 20]. 


B. AR MODELING - CONSIDERATIONS 

Though obtained as a bilinear transformation of the data, the function in ( 11 ) can 
not be considered a true autocorrelation function, since it is not constrained in any wav 
other than possessing conjugate symmetry around the origin. It can not be guaranteed 
to give rise to a positive definite autocorrelation matrix, and hence can not be used to 
directly solve the normal equations. We can however estimate its autocorrelation func- 


® we will be fitting the model 


tion and fit the model to this autocorrelation. In a sense, 
in the fourth moment of the data. In all the results presented, the AR parameters are 
obtained using the Modified Covariance Method. This method has been found to pos- 
sess good resolution properties, while alleviating some of the problems associated with 


the Maximum Entropy Method (MEM) (Ref. 21]. All plots are on a logarithmic scale. 


C. EXPERIMENTAL RESULTS 

Choosing the order of the model is a sensitive issue. From Figure 16, we know that 
the main lobe of IPS consists of two terms, which do not align exactly. We might thus 
expect that, if the correct order is exceeded, the extra poles will tend to resolve these two 
terms, and each spectral component will have two associated peaks in the spectrum. In 
Figure 27, where the test signal is again a single component quadratically chirped FM 
signal, this effect is well visible. In the left, a one-pole model was used (analytic test 


signal). With the addition of one extra pole, we can see that the two terms were resolved 


£ Lacking both temporal and statistical averaging. ( 11 ) can not be considered to represent the 
second moment of the data. 
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(with some bias), and the undesired effect is now present. In Figure 28, a fourth order 


model was used to fit a three component signal. 
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Figure 27, Parametric IPS. First (a) and second (b) order models 


We can observe that, since the main lobes were not over-resolved, the signal components 


are Well defined, and can be located with greater precision than what would have been 


possible with the non-parametric IPS. 
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Figure 28. Three component signal. Parametric IPS. Fourth-order model. 
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VI. CONCLUSIONS AND RECOMMENDATIONS 


In this thesis, an alternative tool for the spectral analysis of signals with time- 
Varying spectral content is presented (IPS). Having its roots in the definition proposed 
by Page of “Instantaneous Power Spectrum”, IPS has a defining expression very similar 
to the one defining the Wigner-Ville distribution (WD). Their performance is, however, 
considerably different. The WD approach, though extensively used in the last decades, 
suffers from a number of short-comings. The foldover frequency of its discrete fre- 
quency is located at z/2 , requiring the use of analytic signals or the use of an interpo- 
lation scheme as an alternative to sampling at twice the Nyquist rate. Interference when 
used with real signals, cross-terms when multi-component signals are present, preference 
of linear dynamics and loss of resolution at the extremes of the analysis segment are 
other problems associated with the WD [Ref. 8]. The distribution presented here 1s 
shown to have some advantages over the WD. These are: evenness of performance for 
different dvnamics of the signal, direct applicabilitv to real signals sampled at the 
Nyquist rate, reliability when analyzing multi-component signals. and preservation of 
spectral resolution at end-points. Also, IPS is shown to behave better in the presence 
of additive white Gaussian noise (AWGN). As a disadvantage, IPS does not achieve the 
frequency resolution provided bv the WD. Due to its robustness, IPS 1s especially well 
fitted to be used as a front-end tool in non-stationary spectral analvsis. The usefulness 
of AR modeling as applied to IPS is investigated. It 1s found to be a technique very 
sensitive to the chosen order model, but providing some advantage in defining the lo- 
cation of each spectral component. The use of other parametric techniques should be 
investigated. In Figure 29, we can see the FSK signal of Figure 20, tlus time processed 


with the following expression: 


De, =+ | ss" —D— ss + D]e PM ade. (115) 


That is, Figure 29 is the imaginary part of the Rihaczek distribution of the test signal. 


As is seen, it shows some natural ability to detect fast transitions in the spectrum. and 


seems to be a promising area for future work. Also, linear combinations between IPS 


and this imaginary part of the Rihaczek distribution should be investigated. 
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Figure 29. Imaginary part of Rihaczek's Distribution - FSK 


Signal synthesis from IPS may also be a promising field of research for speech processing 


applications. 


APPENDIX A. IPS FOR FINITE DURATION DISCRETE SIGNALS 


In this appendix, a formal derivation of the expression defining IPS for discrete 
signals ( 16 ) is presented, as an alternative to the direct discretization of ( 12 ). 


The discrete version of (3) is, assuming that the sequence starts at n =0: 


S (n, 0) = ar) s(mjel" (116) 
ns 
Hence, 
É I 
PSR O) [2 = STS so a” ys s (n) ~ 
n=0 F (117) 
i 
Se: — —j0k 
= ss Ce 
k==1 
where 
i 
Cp = > s(m)s (a =k) k20 (118) 
n=k 
and 
ca (119) 
Similarly 
N=] 
S*(n, 0) = st) me” (120) 
n=I 


= 
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Sn, 0) = ary s(nje > s (ne 
n=1 
Ei (121) 
(N=1-1) 
o y on ae 
=—(N—]—7) 
where 
N=1 
c; = > s(ms (n=k) k20 (122) 
n=It+k 
and 
q =(c). (123) 


Now. to get the digital equivalents of (2)(4) we must approximate the derivatives of 


( 117) and ( 120 ), which we will do using first order differences: 


07(n,0)= AT[ 150.0) P—157—1,0)1?] 


k=-I 
where 
e, =s(n)s (n—k) k20 (125) 
and 
cx = (8) (126) 
In the same way, 
p*(n, 0) = AT[ | S*(n. 8) |? +) S74 1,6) 17] 
sê a a yo (127) 
k=AN=1—1) 
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where 
EN = s (n)s(n +k) k>0 (128) 
and 
O (129) 


Assuming the signal to be zero outside the known samples, then according to Levin's 
definition (6): 


IPS(n, 8) => Lo (1,0) + 0*(n, 0)] 


ns 


130) 





= Al > Es(n)s (n — k) + s (n)s(n + lcm 


K=—00 


which is the wanted result. 


APPENDIX B. INSTANTANEOUS FREQUENCY AND IPS 


The proof of property 6 ( 55 ) can be made as follows: 
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Therefore 


00 


| fIPS(t, fdf 


+ Jmagls (os (0)] 
Is) Ot fst) 


For real signals,this center of mass is zero, as expected, since the spectrum is conju- 
gate symmetric about DC. 


For analytic signals: 
s(t) = m) eP (134) 


and 
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“eee” 


and, finally 


I 


£0 == en] (136) 


which is the definition of instantaneous frequency for analytic signals. ' 


“ This definition of Instantaneous Frequency was introduced by Ville [Ref. 2] in 1952. 


5 


Hence, we have that, for analvtic signals 


| iP Seana 


eee 


and our proof is complete. 
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(137) 


APPENDIX C. NON-STATIONARY SIGNALS AND THE PRINCIPLE OF 
UNCERTAINTY 


When analyzing signals with time-varying spectral contents, there is a maximum 
obtainable frequency resolution, determined by the dynamics of the signal. This is in 
contrast with the stationarv case, where the only constraint on the obtainable resolution 
is the observation time. The presentation and proof of this effect is the goal of this 


chapter. 


ae [HE PHYSICAL CONCEPT 

Before presenting anv mathematical results, we will try to approach the subject with 
a brief discussion of the involved ideas. 

Let us consider our ability to perform spectral estimation on a stationary signal. 
For that purpose. we will assume that the incoming signal has been segmented into 
contiguous intervals of Ar seconds. Our final goal is to determine the spectrum occupied 
by the signal at roughly the moment of analysis, that is, within the last received segment. 

If we are given onlv this last segment, the obtainable frequency resolution is con- 


strained by [Ref. 1] 


Af> —— (138) 


But if now, for some reason. the next to the last segment of data becomes available. 


we will be able to improve our analysis of the last segment up to a resolution of 


Af> —— (139) 


We can continue the process and, if more segments of data become available, im- 


prove our spectral description of the signal at the present time. The rule in this case is: 


e The stationarity of the signal allows us to improve the analysis of the present time 
by using data collected no matter how distant in the past. Hence, if infinite obser- 
vation time is allowed, infinite spectral resolution can be achieved. 
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Let us now consider the same scenario, but with a non-stationary incoming signal, 
which, we will assume, can be considered stationary within the segment. If only the last 


segment is available, our frequency resolution is again constrained by 





(140) 


and the only way to improve it is to consider more segments. But a qualitative difference 
appears at this point. Segments of the “distant past” are rendered useless by the dy- 
namics of the signal unless we know or assume its behavior. Considering those segments 
will only degrade our spectral analvsis. The usable data is thus restricted to the most 


“recent past’, Which implies an upper bound on the obtainable resolution. 


e Fora non-stationary signal with unknown dynamics. there is a maximum obtaina- 
ble frequency resolution, which depends on how distant the unusable “distant 
past’ is. It is an absolute maximum, and depends only on the dynamics of the 
signal. 


Proving this statement is the purpose of this Appendix. 


B. MATHEMATICAL FORMULATION 
Let us briefly discuss Gabor's result. Gabor [Ref. 1] defined the “effective duration” 


Ar and the “effective frequency width” Af of a signal w(r) by the following equations 


Ar=./2x(r— i) 
(141) 


Af= yj 2x(f-fy, 


where [.] stands for the average. and the » moments are defined, ommiting the argu- 


ment of W(t). as 
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resulting in 


ArAf > (143) 


lt 
7 
This only states that any signal whose effective duration is Ar ( see Figure 30 ), will 


have an effective bandwidth Af ( see Figure 31 ) greater than sã ; the issue of uncer- 
tainty is not addressed in this result. 
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Figure 30. Effective duration of s(t) 





Figure 31. Effective bandwidth of s(t) 


In fact, its relevance as an uncertainty principle only comes into play when we realize 
how classical Spectral Estimation 1s achieved. 
Classical methods are based on the “measure of similarity” between the data and 


more elementary functions of time which have strong affinities with the physical concept 


63 


of frequency. The best example is the Fournier Transform, which conceptually is a bank 
of correlators, trying to determine the “measure of similarity” between the data and 
sinusoidal functions, the true representation of our concept of frequency. Other typical 
examples are in the works of Gabor [Ref. 1] and Priestley [Ref. 22], using elementary 
functions on the t—f plane, where these functions have what Priestley calls an 
“oscillatory form” needed to establish the correspondence to our notion of frequency. 

Now, if the data is of length 7, the elementary functions can have at best the same 
duration, which implies that their frequency width is greater than +. Since the ob- 
tamnable frequency resolution 1s at best of the size of the frequency width of the ele- 
mentary functions, the principle of uncertainty in Spectral Estimation follows. 

Gabor’s result is only concerned with stationary signals, or at least assumed to be 
stationary, since Af is the effective width of the Fourier Transform of the signal. How- 
ever, some implications can also be derived for the non-stationary case. 

Let us assume we have an arbitrary stationary signal y(r), which we will consider to 
be analytic to simplifv the formulation, with effective duration Az, effective bandwidth 


Af and constant instantaneous frequency. Formally, 


Id ' z 
a [ are[yw(1)] ] = constant (144) 


Let us now force the signal to linearly chirp in frequency, by multiplying it with the 


phasor e'*” and defining the modified signal y u(t) by 
OO (145) 


where k is a positive real number. Due to the exponential, Y (1) 15 now a non-stationarv 


signal chirping linearly in frequency with a slew rate of 


ie (146) 


Cl 


The following results can be established for w,,(z): 


e Effective time duration 


The effective time duration will be 


Alyy = re ae final (147) 


64 


where 


Ence, 


Effective bandwidth 
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we have that 
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| Y a | y wkidt 
ol Do Seo o 
ojo — (154) 
Y ydi y ydr 
+ d + 
| Ww q | y widt 
nn l AD, =v 
o 2) e a dies 
| W wat | Y ydi 
=f+ki. 


Hence, all we are missing to complete. detemuine Ay 71s the expression ie a 
Mere Ne 











diy dy 
dt dt 
PRA 1 ano E 
E Tn, Wes 
(Ar) (22)? oO ( ) 
WW yet 


Noting that 


diy, diy db db a dy | 
di aaa = ERY at ss 


+ 


dy 
dt 





+(2nky ru, (156) 


66 


we have 











o a — j2nkty AE d janky Z + (22k) yy jd pa 
a lo to 
Vai) = (22) E 
| Ww wat 
i a . (157) 
E Pd | Wt ~ di | Ea 
E a dk lo Kto 
(tt ta E on | 
| ae | Y ydi | y wat 


Considering now the following identities 











dy Wy d E 
7 SRA RU ae yy E E 
„dy YY d i 
Vi = E [Iv TA jay E [are(p) 1 
Me lave 
» d 
| Yop ~y Caret) Jai 
PA = 09 120) + £ =——_——— | (159) 
| Y ydi 
and, using ( 144 ) we get 
(2) = (+P) + 2856. (160) 
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Since [Rete] 


YEN dt 
i alge ek (161) 
| Y ydi 
we have, by ( 144 ) that 
| ww di 
le (162) 
| Y yd 
From where, 
A- DEE ) + 2kif. (163) 


Hence, from ( 151 ) and ( 154). 
Afif = 27] (1) + 440) + nif | —2n[ FP + GY + 2x7] 
=2 (19 +e 1 (?)- 0 | (164) 
= (Afy + KA). 


Therefore, using ( 150 ) we can write the effective time and frequency widths of the 


modified signal w,,z) in terms of the effective widths of the original stationary signal 


v(1) as 


(Ah! = (A + AD”. 
mi = (Ar) : 


(165) 


Using ( 143 ) and ( 165 ), we get the result 
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(Afy At) = (A0 [ (AN! + «9407 ] 





= (AA + K (AD (166) 
Lian 
2 7 +k (Ad , 
from where 
(Nga E +k (An. (167) 


Hence, if we minimize the right hand-side of ( 167 ), we will find a lower bound on the 


effective bandwidth of the chirped signal. Proceeding with the minimization, we have 











x > (At) |=0 = 7 +2k (Ar) =0 
CALL (At) A(A1) 





Hence 
(169) 


and we can read the obtained results as stating: 


Lemma l. A linear chirp will always occupy an effective bandwidth greater than 
SES ” . . . 
af [dí , independently of its effective duration. 
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Lemma 2. A linear chirp will occupy the minimum effective bandwidth of /df/dr , 


onlv if its effective duration 1s 
= 
dfi 


= 


AG 


These results will have obvious implications when classical spectral analysis of the 
chirped signal is attempted; no matter what the duration of the elementary functions is, 
the signal will always appear to occupy more than ./df/dt , and this will thus be the best 


obtainable resolution. This completes our proof. 


Lemma 3.The maximum obtainable frequency resolution when analyzing signals 


with unknown frequency dynamics is ./df/dt . 
Though the previous discussion was restricted to the case of a linear chirp, the 


underlving ideas remain valid for laws of variation other than linear; the treated case can 


be considered as a first order approximation for arbitrarv non-stationary signals. 
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APPENDIX D. PERFORMANCE IN NOISE 


As we will see later in this Appendix, an absolute evaluation of the performance of 
IPS in noise is not possible, because the second moment is data dependent. We will thus, 
instead, compare the noise performance of IPS with the noise performance of the 
Wigner-Ville distribution. Let us consider the inverse Fourier transforms of IPS and 
WD. 


CI(t, T) = > Es(Ds (1 — T+ s (t)s(t + 7)) 
versus (172) 


7 = ee nl 
A) E y )S (1 7)» 


Given a signal z(r) that is corrupted by WAGN, where the noise is analytic and 


bandlimited to the maximum frequency (HW of the signal, thats: 








(O) = z(t) + n(t), (175) 
where 
E No O< w= HK (173) 
ni) = O otherwise 
o FF 
No sin( E ) j Wir - 
Ge) = = A (175) 
2 
A. MEAN 
l. WD 


CIV (i t) = x(t + A ro- z E | 2(¢ - Er )+n(t+ T EM — ol E F | 
2 


Esaf E MD A a ==>) (176) 
E 


mal 


E(CW (1, 1)} = CWA. 1) + 20+ Efn" = y 
i (177) 
+2 (= SDEjn(t+ (=. 


Hence 


E(CW(1, 2)} = CIVA2, 7) + Rolo). 
2.0 IPS 


C(t, t) => [ v(i) (t — 1) +v (v(t +2) ] 


de 
2 
= [2(1) +n()]} (0 7) +11 2)] (179) 


+S O HOJE- 1) + nl 9). 


ECA) = A(z e- o) A Efan- o) 


to |— 


+ 2 (elt + 1) +A Efn (nl + o) (180) 


Senne 5 ON 


att 
2 
Hence. 


E{CI(t, 1) } = Cl{t.t) + R(t). (181) 


Therefore, IPS and WD have, as far as their first moment 1s concerned, identical be- 


havior in the presence of noise. 


B. VARIANCE 


|. WD 
[CO e > + y) 
= 20 +A O 


| CWC, 2) P= [z(t + > )z (1 Es 5) + 2(1 IS — E 


a (1 SS 





- 
$ Ñ T ll gg ref i (183) 
2 Ur AU )+z O 
eed) aa ro =). 
Using now the expectation operator, we have: 
ELL CW, Dl} = 2+ SYP Lee) P 
++ E -Ein 
++ PE ae- 
+20+ 3) En (+ 1-5) 
, A (184) 
ar (>) + En tt a) 
+) PE y 
+ Ed re + E Ja (1 — E ieee a \n(t — = ) 
+ (+ St nl +) )\ | 
And, since? 
En a+ Sp = Ellos He E ) 0. (185) 


N 
A MS E 
+ |2(1+ > ) | z +Z (1 + 5 Naty IR, (7) 





2 
a | All eee 





i See Appendix F. 
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where we made use of the fact that, for zero mean Gaussian random variables 


(Ref. 23], 
En 
o Elxxxx) — Elm) E (xa) do El) EA oe a E 
Hence, 


To T N T 
a PY 1d DIS ari a] 





2 

+ RAS A qu 
¿AN 
+ |R,(t) | + 7 ). 


Now, 
Var{CI,(z, 1)} = Ef | CIP 2) P} — ELC Le, DELCW 0) 

E$ | CIF (1, 1) P} — ECW, 1) + RADIO, 2) + Rito) 

Bake E SR C= Sasol F 


188) 


— CIV (t. t) R(t) — CH (a, DRT). 


From where. using (ISA 





E x 2 
NTF NAT 
Var[CIV (1. 1) = | | A Pa lee F P | a ( a (189) 


Zoe LBS 


Ch, 7) P= LOr (¢-—a tv Ov + Oly (Ove — o) + do (+) 


al 

4 
+ Dor (e — are — 1)] + 4 [v (Drle + t) (Dur — 1)] (190) 
+ Or E DA E O o). 


For simplicity, we will treat each term in ( 190 ) separately, and use the fact that 


the second and third terms are complex conjugates of each other. 
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First term 


a A Did (Dv ( — T)r(1 — 7)) } — 


+ + = + E! E D|- a 


El2(0 + (019 + (DIU) +1 (1 Dele — 1) + (1 ))) 

y Efl) (D) +20 (0) + znl) + nfn] 

“(1 1) MAA (191) 
z(t— t)n (t= 1) +n (t — t)nlt — o] 

r [e(z (i)z l- i)z — 1) + | z(e) PELI ne — 1) 1°} 

202 (1 DEN (On 0) + 202 — DE On l a) 

| 2(¢ = E 1 no PY + Elton (On: — Jn (e). 


Simplifving, 
Es = [voy (dv (t— tr - DJ} = + | 2(2)2"(¢ — 2) |’ 

oia 
27 | 


No lt’ a 
Z KORC — DRM?) + Ei - ) + + RR IR 


QT 





- 0 P+12009 P+F 2070 DR) (192 
E 





E als L| 


+ 


Second term 
Eta or) b = 
E) + (DIRA + o) ++ DI) + (Ot 1) + (1 1) ] 
[2 (Ðzlt + 7) +2 00 + 0) +02 +5) +12 (0n(1 + 2)] 
{e (Hz 0 +2 (0n(1— 7) +1 (0920 — 2) + (On(1 — 2)] 
+ 0 (d+ Da) +2 (02 + DEn (0n(1 — 2) 
+ 2 (Oz(t— DEfnçe + on (bz a+ DE (Oni — 7) 
+2 (00 DEN (On(1+ 0) +Eln (Da (On(1 + o)n(e — 7))]. 


El = [y (eet t)r (Dvr of = + z (t)z (t)2(t + t)2(t — 7) 
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e Third term Since this term is the complex conjugate of the second term, we 


have: 
E + E) (to oh a + doado (+ dz (> 2) 


++ Arye O + (OR 


(195) 
+ (oz “(t+ DRT) +52 2(0)2 (t — t)R,() +> ee 
e Fourth term 


i TOt e+] h= 


Dn (DM ++ DIO aO tn e )]) 


RM e a 


“(1 +0+ 200 (0+7)+n(07 (1 + )+n(on (1+7)] 


Elle 
Effz (z(t + 1) +z (Dn + 2) +n (Nett) +2 (On(t +7) 
Ee 


( 
E (ze(r + Tje z(t e t)Efn()n (t+ t)} 
| 2(2) PE | nfe + 1) °} + | 2(e + 2) PEL | nz) |} 

(De (+ En (On(t+ 0) + En (on(on(+ On (1+))). 


«e 


Es 
4 
+ 
+: 


E + Dto + Orr (+ on -= = lz (f)2(8 +t) |’ 





1 AGI ; Tae 
A O AO 
Los AAA 

q Et) + DRM) + P ) + q) Ro). 





Adding the four terms ( 192 ), ( 194). ( 195 ) and ( 197 ). we thus obtain the 


expression for the second moment of IPS. Now, 


Var{Ci( 0} = Bul, ole Peter an (198) 


where 
E(CI At, DI E[C1 (1, DY =[CL (0,7) + RADITCL(, 7) + Rao] 
= CL, CL, 7) +CL(t, DR (r) (199) 
+ Cia DRA) + | RAT) É. 
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Substituting in ( 198 ), we get that 


Var{Cl,(z, t)} = mero At, 2) If = EACH, D}E(CH (2, DY 
No! 





Sara tao Pla o Pelo Pedais PO 
+ Lact — RCo) +2 z+ DRG) (200) 
+2 (020 RAD) + (de (+ DRA) 


> 2 
EE DD Lie 
ro rl e iz 


“encan non compare the vanances of C/i, Tt) and CH(r, t) . First of alle 


realize that, in both cases, the variance is a function of t and t: If t =0, we have: 








No! No 
Mar CI, t)} a z A z(t t) |? + 27 (201) 
ae nee a 
l Noli , Noll 
Ma CI Ir eN (202) 
That 1s, 
NARICES VAN (== Ma (205) 


for: = 0) we havedronr 2200) that 


NAT’ 
Var{Cl(t, D) < + =3—[ 1200 Pelar 3) 20H 1200 | 


LIROI E-l O+ 


. (204) 
+|z(dz(t-)|+|z(tz (t+) |] 


z 2 
i 1 (Nok 
Por 


a, 


VaríCI(1, t)} < La [aos 1200120 + ale T) 
HADA T) +) OM MEM AO 


NW N 
+1209 [911+ (SE) 


The time dependency of the expression stops us from a direct comparison with 
CH" (2, t). However, since the signal is arbitrary, we have no reason to assume anv or- 
dering of its magnitude at different points in time. We can thus average over time, and 


get the following result: 











Nou’ 
< Var{Cl,(t, D}> a S eo [4< 12) P> ay t < L214 as 
whe ZC) NN Bar) ES E 
GIN (206) 
É 206 
+ < 1x09 11> od + (E> ) 
2 
Nol’ 5 ( Nol¥ 
< T SS ee 27 É 
Using ( 189), 
<VariCiltv> E Natl Glin) ee (207) 
and finallv, we have 
0 TZ 0 
Vari Cl {t, t)} = varn W (208) 
7 
< Var{Ch(t, )}> ay < < | Var[ Ca, y > as (209) 


For the case 7% 0, I'>l, we can make the following approximations, using 
( 189 ) and ( 200 ): 


1/ No \2 
< VarL RE (210) 
Not \2 
< Van I R ee E (211) 
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That is, when the bandwidth of the noise approaches infinity, the difference in 


noise performance between IPS and WD approaches the limiting value of 3 dB. 
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APPENDIX E. A DIFFERENT DEFINITION OF “INSTANTANEOUS 
POWER SPECTRUM” 


In accordance with the discussion in Section III-C, the following definition of In- 


stantaneous Power Spectrum can be made. 


e Definition - Instantaneous Power Spectrum at the time of a given sample is the 
frequency content of the stationarv spectrum implied by the existence of that 
sample. 


That is: reading the Fourier transform as a change of basis in an N-dimensional 
space, the Instantaneous Power Spectrum will be the difference in coordinates (referred 
to the phasor axis) between the location of the signal in the N-dimensional space and its 
location in the (N-1)-dimensional space that results from suppressing the dimension 
corresponding to the sample of interest. 

Since in terms of the internal product, the absence of any dimension can be repres- 
ented bv a Zero component of any of the vectors along that dimension, this definition 


leads to a three-step procedure: 


E Compute 
2 
|A(w) | , (212) 
where 
Xw)= > (meter (213) 
R==00 


2. Define a new sequence, obtained by zeroing x(m) at the sample of interest, that 1s: 


n=Y 


am nee 214) 


where r is the time at which the Instantaneous Power Spectrum Is to be determined, 
and obtain 


LX(w) |. (215) 


3. Obtain the Instantaneous Power Spectrum as: 


2 
Instantaneous Power Spectrum = | A(w) | — | AXw)| . (216) 
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The resulting distribution will be ( using extended sequences, as in Appendix A ) 


| Xw) | = Xlo (0) 


- D x(nje 7". E x (ne 
n=-—-00 
R=—00 
and, after some manipulations, 
OO 
| Xo) e 
Z 
where 
c=) (nx (n-k) k20 
MER 
and 


Correspondingly, 


> OO 
Xe) | = Y bpe 
= 00 


where 
b, = Y xx (nm=k) k>0 
n=k 
and 


Sl 


(217) 


(218) 


(220) 


(223) 


Hence, we have that 


2 i | Se | 
|X(w) | O > bye 


k=—00 k=—00 


= Mean, 


where 
A, =lx(Nx(r=k)+x(r+0x (0) k20 
and 


* 


A_,=A,. 


(224) 


(225) 


(226) 


that is, this definition leads almost exactly (up to a constant, for fixed time) to IPS. 


What is important to realize is that this definition has no concept of past or future, and 


that all points are equally treated, no matter what point they occupy in the sequence. 


This result seems to put an end to our hope of finding assymetries in the IPS concept. 


As a final note, a more formal proof of the relation between this distribution and 


IPS can be made as follows: Define 


x(n) = o a 


IS 


0 eS" 
a Lo n=r 


emcee: 
x(n) = x{n) + d(n) . 


Let 
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(229) 


(230) 


Then, 
X(w) = X{w) + D(w) (231) 
and 


| X(@) | O = (X(@) + Eo D'(w)) — X(0)X (0) 


E (232) 
= X,(w)D (w) + X,(w)D(w) + D(0)D (w). 
Bue know that Diw)= x(1)e +, Therefore, 
| Xlo) É — 1X 400) || = 2RefX (0)x" (e) + Doe (e 
= 2Re| LX) + D(w)] x (re — | x(r) |? 
(233) 


= 2Re[ Nox (ne) — | x(r) | 
= 2Ar1PS — | x) |. 
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APPENDIX F. ON THE SECOND MOMENT OF ANALYTIC SIGNALS 


The purpose of this appendix is to proof the following theorem: 


Theorem: For an arbitrary analytic random signal z(t), defined as 
2(t) = x(t) + j3 (6) , 
where x(t) is the Hilbert transform of x(t), the following identity is satisfied: 


E{2z(n)z(s)} = 0 


Proof: 


E{z(n)z(s)} = Ef (x(n) + XI x(5) +91) 


= E{x(1)x(s)} + jE{x(n)x(s)} + jE{x(n)x(s)} — E{x(n)x(s)} . 


But, from the properties of the Hilbert transform, we have that 


i) ae O) 
slo) (a) 


Hence, 


E{z(n)2(s)} = Rodn— 8) +jRxx(n— s) +jRxx(n — s) — RXX(n — 5) 
=0. 


Our proof is thus complete. 
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